Transgenic Animals for Studying Regulation of Genes 

Background of the Invention 

Vascular endothelial cell (EC) activation is one of the first events in inflammatory 
conditions and transplant rejection. A specific marker of EC activation is E-selectin. 
5 Expression of this gene is generally absent under resting conditions and induced by 
inflammatory stimuli, as well as being NF-kB dependent. Expression is restricted to the 
endothelium. 

Immunohistochemistry is a standard technique by which signalling events involved 
10 in disorders associated with inflammatory, thrombotic, ischaemic or neoplastic conditions 
or transplant rejection, can be assessed; and, for example, E-selectin expression can be 
readily studied by this method in vitro . However, monitoring of E-selectin expression in 
vivo has been limited to immunohistochemistry of biopsies or post-mortem specimens. 
There has been a need by workers in the art for an in vivo model allowing assessment of 
1 5 E-selectin expression in a live subject. 

Summary of the Invention 

The present invention relates to an animal model useful for screening potential 
therapeutic agents for the treatment of disorders associated with inflammatory, thrombotic, 
20 ischaemic or neoplastic conditions or prevention of transplant rejection. In particular, the 
present invention provides a novel model in which endothelial cell activation can be 
assessed in vivo by a simple and sensitive enzyme assay. 

Brief Description of the Drawings 
25 Figure 1 (top panel) schematically illustrates the strategy which is pursued in 

Examples 1 and 2 for targeted replacement of the native mouse E-selectin allele with 
secreted alkaline phosphatase (SEAP). The SEAP-1 gene, or the SEAP-2 gene fused to 
the SV40 late mRNA polyadenylation site, is inserted into the mutagenized start site (ATG) 
of the E-selectin gene, together with a Neo R/loxP fragment. Figure 1 (bottom panel) 
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schematically depicts the SEAP-1 and SEAP-2 messenger RNAs. In all of the figures, 
genetic constructs are conventionally depicted with the 5' terminus at left. 

Figure 2(a) ("Endogenous E-selectin locus.") schematically depicts the E-selectin 
5 targeting constructs prepared in Examples 1 and 2 for carrying out homologous 

recombination in murine embryonic stem cells. Said constructs comprise the SEAP-1 
gene (or the SEAP-2 gene fused to the SV40 late mRNA polyadenylation site) inserted into 
the mutagenized start site (ATG) of the E-selectin gene, together with a NeoR gene 
flanked by loxP sites. Figure 2(b) ("Targeted locus.") depicts the heterologous E-selectin 
10 allele (the "targeted locus") (also referred to as the "knock in allele") resulting from 

integration of the targeted construct into the murine chromosomal E-selectin allele. Figure 
2(c) ("Targeted locus after Cre-mediated neo cassette removal.") shows the targeted locus 
following Cre-mediated removal of the neo casette and one of the LoxP sites. 

15 Figure 3 ("Induction of SEAP by cytokines or LPS") depicts elevated plasma levels 

of SEAP-1 (left panel) or SEAP-2 (right panel) in TNF- or LPS- induced heterozygous 
lines, with maximum levels of plama SEAP attained at 6 hours (SEAP-1) or 24 hours 
(SEAP-2) post-stimulation; and by comparison the absence of induction in NaCI or IFN 
controls. 

20 

Figure 4 ("Analysis of SEAP expression in the two lines") is a bar chart representing 
pooled data from n experiments which shows elevated plasma levels of SEAP-1- and 
SEAP-2 (left panel) and fold stimulation (right panel) in transgenic heterozygous (+/-) mice 
at 6 hours following LPS or TNF induction, compared to baseline levels. 

25 

Figure 5 ("Comparison of heterozygous and homozygous animals.") is a bar chart 
representing pooled data from n experiments which shows elevated plasma levels of 
SEAP-1 (left panel) and SEAP-2 (right panel) in transgenic homozygous TNF-induced lines 
relative to transgenic heterozygous lines. 

30 

Figure 6 shows the localization of SEAP expression by alkaline phosphatase 
histochemistry performed on cryosections from tongues of SEAP-2 mice 6 hours after 
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saline ("SEAP-2 control") or LPS injection ("SEAP-2 LPS"). The panels marked "LPS" 
show the immunolocalization of E-selectin in LPS treated mice. Arrows point to stained 
vessels. 

Figure 7 charts the elimination half-life of SEAP in TNF- or LPS- treated SEAP-1 
animals in relation to the elimination curve of i.v. administered alkaline phosphatase. 

Figure 8 ("Inhibition of SEAP expression by proteasome inhibitor.") illustrates 
inhibition of SEAP expression in an SEAP-1 line by a proteasome inhibitor, clasto- 
lactacystine-p-lactone, as compared to placebo, and placebo and TNF. 

Detailed Description 

The present invention provides in a first aspect a transgenic non-human animal 
wherein expression of an exogenous soluble marker specifically by endothelial cells, e.g., 
vascular endothelial cells, is regulatable by a chemical stimulus, including a cytokine, or a 
physical stimulus including light or temperature. 

In particular, the invention comprises a transgenic non-human animal comprising a 
polynucleotide encoding a soluble marker protein functionally linked to a regulatory 
sequence of an endogenous gene encoding E-selectin. 

The soluble marker protein is preferably an exogenous protein, Le. which is not 
normally produced by the animal. Alternatively, the protein may be produced by the animal 
but is not normally under the control of a regulatory sequence of an endogenous gene 
encoding E-selectin. The protein is selected to be detectable in plasma or blood samples 
by standard experimental means. 

Transgenic non-human animals include animals, e.g., mammals, e.g., pigs, cows, 
goats, as well as non-human primates, such as monkeys, and rodents, such as mice and 
rats, into which a suitable DNA construct has been introduced directly, as well as progeny 
of such animals still retaining said construct. Examples for useful animal lines include any 
animal line normally kept as laboratory animals. This invention shall also be understood to 
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include in all its aspects somatic recombinant non-human animals. The term "transgenic" 
refers in general to animals having an exogenous gene (i.e. non-native gene, Le. trans- 
gene), such as a marker gene, in the chromosomal DNA of their germ line; and the term 
"somatic recombinant" refers to animals having an exogenous gene, such as for example a 
5 marker gene, in the DNA of at least a portion of their somatic cells. 

Soluble marker proteins of the invention include proteins with or without enzymatic 
activity which are conveniently secreted either naturally or as a result of genetic 
manipulation into the extracellular space. Genes encoding suitable plasma-soluble, non- 
10 toxic marker proteins are known (e.g., human growth hormone), commercially available 

(e.g., a modified, secreted heat stable alkaline phosphatase), or may be prepared following 
established routes of gene identification, isolation, amplification and modification, e.g., 
ligation. 

15 In preferred embodiments, the soluble marker transgene of the invention is a 

secreted form of heat-stable human placental alkaline phosphatase gene lacking the 
membrane anchoring domain, referred to herein as "SEAP"(for "secreted alkaline 
phosphatase"). Of course, the introduced marker gene need not be limited to the SEAP 
gene, but may also comprise one or more soluble marker proteins having additional 

20 reporter functions. 

In its more particular aspects, the invention contemplates a transgenic (or somatic 
recombinant) non-human animal having inserted in its genome a soluble reporter gene, 
wherein the soluble reporter gene is under the control of the promoter of an E-selectin 
25 gene of a chromosomal E-selectin allele of said animal. 

The inserted soluble marker gene can be placed at a variety of positions within the 
genomic E-selectin sequence, provided that its expression remains under the control of the 
endogenous E-selectin promoter. 

30 

The exogenous soluble marker gene is preferably inserted into the genome of the 
subject animal by means of a "targeting construct" comprising the marker gene under the 
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control of the native E-selectin promoter of the animal to be rendered transgenic or a 
somatic recombinant, and optional additional transgenes. 

Preferably, the targeting construct is also constructed so that the marker gene or 
5 other transgene is inserted in a manner to decrease or prevent (Le. "knock out") 
expression of the native E-selectin structural gene. 

Most preferably, the genetic construct is inserted at the E-selectin allele 
transcription initiation site, at the E-selectin allele translation initiation site, or at any 
10 position in between. 

The targeting construct is generally inserted into the host animal genome by 
homologous recombination into the wild-type chromosomal E-selectin allele, and therefore 
can be engineered to recombine at any of a variety of positions in the E-selectin allele so 
15 long as the inserted marker gene is positioned such that expression of the soluble marker 
gene is under the control of the E-selectin promoter. 

In a preferred embodiment, the E-selectin gene of the non-human animal is 
modified, as a result of the homologous recombination, by insertion of the genetic 
20 construct at a site which is between the transcription and translation initiation sites of the 
native E-selectin structural gene of the animal, thereby placing expression of the soluble 
marker gene under the control of the native E-selectin promotor, while "knocking out" 
normal expression of the E-selectin structural gene. 

25 The resulting allele, comprising the inserted soluble marker sequence under the 

control of the native E-selectin promoter, and optional additional transgenes, is often 
referred to as a "knock-in" allele. 

The invention contemplates a transgenic animal that is either homozygous or 
30 heterozygous for the knock-in allele. 
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In its most preferred embodiment, the invention consists of a transgenic mouse that 
is homozygous or heterozygous for an E-selectin allele having inserted therein a genetic 
construct comprising a soluble marker gene under the control of the E-selectin promoter, 
wherein said inserted genetic construct diminishes or prevents expression of the native E- 
5 selectin structural gene of that allele. Such a mouse is commonly referred to in the art as 
a "knock out" mouse. 

The targeting construct is preferably constructed so as to assist in homologous 
recombination of the synthetic sequence with the endogenous gene; and accordingly, will 
10 also preferably comprise at least elements of the native structural E-selectin gene. 

As indicated above, the targeting construct may comprise additional components 
such as one or more additional reporters, selection markers such as a neo resistance 
cassette, and the like. (Upon insertion into the cellular genome of the subject animal, 
15 certain selection markers may be excised, for example, by Cre-mediated neo cassette 
removal.) 

Recombinant DNA constructs of the present invention may be prepared according 
to procedures known in the art, starting from, e.g., a genomic clone. A gene encoding E- 
20 selectin and its regulatory sequence may be identified and amplified following procedures 
known in the art, e.g.. by using appropriate primer pairs. Whether one or the other of the 
methods known in the art for the preparation of the DNA constructs is applied may depend 
on their intended further use. 

25 The targeting construct of the invention is preferably prepared from an isolated 

genomic clone of the native E- selectin gene of the subject animal by inserting the marker 
gene at a suitable position (1) to be under the control of the E-selectin promoter, and (2) to 
reduce or prevent normal expression of the native structural E-selectin gene. 

30 Preferably, the marker gene itself is inserted between the transcription and 

translation initiation sites of the isolated genomic E-selectin clone so as to "knock out" 
normal expression of the endogenous functional E-selectin gene. 
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In a particular embodiment which is illustrated in Example 1, step (a)(v), the 
targeting construct comprises, in order from 5' to 3': the native murine E-selectin 
promoter; a fragment of the native murine structural E-selectin gene comprising exon I; an 
SEAP-encoding component (comprising a Kozak consensus translation initiating signal, 
the SEAP coding sequence, and an SV-40 late mRNA polyadenylation signal) cloned into 
the mutated translational start site, Mfe I, of the murine E-selectin gene in exon II; a neoR 
cassette (comprising a loxP site; a neoR cassette, and a second LoxP site) cloned into 
intron II; and an additional fragment of the structural E-selectin gene comprising exons III, 
IV and V. A species of such a targeting construct has SEQ. ID. NO:9. 

In another embodiment of a murine SEAP-targeting construct which is illustrated in 
Example 2, step (a)(ii), the targeting construct comprises, in order from 5' to 3': the native 
murine E-selectin promoter; a fragment of the native murine structural E-selectin gene 
comprising exon I; an SEAP-encoding sequence (comprising a Kozak consensus 
translation initiating signal and an SEAP coding sequence) cloned into the mutated 
translational start site, Mfe I, of the murine E-selectin gene in exon II, a neoR cassette 
(comprising a loxP site; a neoR cassette; and a second loxP site) cloned into intron II; and 
and exons III, IV and V. A species of such a targeting construct has SEQ. ID. NO:10. 

Thus this invention provides a recombinant DNA construct comprising a 
polynucleotide having a nucleotide sequence as illustrated in SEQ ID NO:9 or SEQ ID 
NO:10. 

Well-established methods for germ-line or somatic insertion of a DNA construct 
include viral or non-viral vector-mediated gene transfer into fertilized eggs, zygotes or early 
embryos and/or a specific tissue (such as brain) in the adult animal, e.g., by gene transfer 
into embryonic stem cells, retroviral infection of early embryos or pronuclear microinjection. 

For example, gene transfer into embryonic stem cells may be carried out using a 
"knock-in" strategy performed, e.g., in two sequential steps: (1) by first generating a 
recombinant DNA construct (the "targeting construct") comprising a genomic clone of the 
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Mm E-selectin gene having inserted therein an exogenous genetic construct comprising the 

Wm soluble marker gene of interest ; and (2) by transfecting or otherwise inserting the 

W targeting construct into the embryonic stem cells under conditions suitable to effectuate 

' homologous recombination of the targeting construct with the endogenous, chromosomal 

5 E-selectin allele. 

Identification of successful embryonic stem cell clones may be by, e.g., using the 
Neo-loxP approach, Le^ by insertion of a Neo resistance cassette carrying the thymidine 
kinase promoter and Neo cDNA flanked by loxP sites, into an intron of the E-selectin gene. 
10 Successfully targeted ES-cell clones may then be transfected with a plasmid expressing 
the Cre-recombinase, thereby removing the neo-selection cassette (as well as one of the 
loxP sites in intron II). 

Using standard technology for the preparation of a transgenic animal ( e.g., a 
15 mouse), the insertion of the targeting construct is made in a single allele of the native E- 
selectin gene, and through genetic crossing, there can thereby be obtained a fertilized egg 
with the insertion in either a single E-selectin allele, i.e. heterozygous, or in both E-selectin 
alleles, i.e. homozygous. Further manipulation of resulting fertilized eggs, zygotes or early 
embryos and breeding of resulting transgenic founder animals follows established routes 
20 of breeding transgenic animals. Successful transgenic non-human animals may be 
identified, e.g., by fur colour, e.g., by the absence of fur colour. 

Accordingly, the targeted E-selectin gene ("targeted locus") of Example 1, following 
Cre-mediated neo cassette removal, is schematically depicted in Figure 2(c) hereof, and 

25 comprises, in order from 5' to 3': the native murine E-selectin promoter; a fragment of the 
native murine structural E-selectin gene comprising exon I; an SEAP-encoding component 
(comprising a Kozak consensus translation initiating signal, the SEAP coding sequence, 
and an SV-40 late mRNA polyadenyl-ation signal) cloned into the mutated translational 
start site, Mfe I, of the murine E-selectin gene in exon II; a single loxP site in intron II, and 

30 an additional fragments of the structural E-selectin gene comprising exons III, IV and V. A 
species of such a targeting construct has SEQ. ID. NO:1 1. 
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Likewise, the targeted E-selectin gene ("targeted locus") of Example 2, following 
Cre-mediated neo-cassette removal, is also schematically depicted in Figure 2(c), and 
comprises, in order from 5' to 3': the native murine E-selectin promoter; a fragment of the 
native murine structural E-selectin gene comprising exon I; an SEAP-encoding component 
(comprising a Kozak consensus translation initiating signal and, the SEAP coding 
sequence) cloned into the mutated translational start site, Mfe I, of the murine E-selectin 
gene in exon II; a single loxP site in intron II, and an additional fragments of the structural 
E-selectin gene comprising exons III, IV and V. A species of such a targeting construct 
has SEQ. ID. NO: 12. 



Thus this invention also provides a recombinant DNA construct comprising a 
polynucleotide having SEQ ID NO:1 1 or SEQ ID NO: 12, and a transgenic (or somatic 
recombinant) non-human animal comprising in its genome a nucleotide sequence having 
SEQ ID NO:1 1 or SEQ ID NO: 12. Most preferably, the invention consists of a transgenic 
15 knockout mouse comprising in its genome SEQ. ID. NO:11 or SEQ. ID. NO;12. 

The transgenic or somatic recombinant non-human animals of the invention are 
useful for readily assessing and non-invasively monitoring in a body fluid, e.g., in plasma, 
the expression of a marker for E-selectin. An advantage of this model is that the 

20 expression of a marker for an inflammatory, thrombotic, ischaemic or neoplastic condition 
can be followed in an easy and experimentally reproducible system, and time course 
analyses are possible in this model without killing the animals. For example, this model 
offers the possibility to assess the specific activation of endothelium in different disease 
models of e.g., inflammation, angiogenesis, atherosclerosis, thrombosis, and acute or 

25 chronic transplant rejection. The model can be used for profiling of compounds inhibiting 
specific signaling events related to the expression of E-selectin. For example, the model 
can be used to study the specific effect of inhibitors of the NFkB pathway. 

Models based on the transgenic or somatic recombinant non-human animals of the 
30 invention may be used for example to identify and assess the efficacy of potential 

therapeutic agents in disorders associated with inflammatory, thrombotic, ischaemic or 
neoplastic conditions. In particular such models may be used in screening or 
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characterization assays for detecting agents likely to, e.g., prevent or treat inflammation, 
angiogenesis, thrombosis and acute or chronic transplant rejection. 

Accordingly in a further aspect the invention comprises a method for testing a 
5 potential therapeutic agent for a specified condition, in particular a disorder associated with 
inflammatory, thrombotic, ischaemic or neoplastic conditions or transplant rejection, 
wherein the agent is administered to the transgenic non-human animal. Moreover the 
invention comprises a screening or characterization assay consisting in or including such a 
method. 

10 

Methods for testing potential therapeutic agents using animals are well known in 
the art. The transgenic non-human animals of the invention may be used in analogous 
manner. 

15 The effects of the potential therapeutic agent may be determined by administering 

the agent to a transgenic non-human animal, monitoring marker concentration in a body 
fluid, e.g. t plasma, and comparing the result with a result obtained from an untreated 
transgenic or somatic recombinant non-human animal. 

20 In a further embodiment the present invention is directed to a novel modulator of a 

regulatable protein identified by a screening assay comprising administering the potential 
modulator to a transgenic or somatic recombinant non-human animal, monitoring marker 
concentration in a body fluid, e.g.. plasma, and comparing the result with a result obtained 
from an untreated such animal. 

25 

Methods for monitoring marker concentration in a body fluid are well known in the 
art. For example, concentration of a marker enzyme may be determined by quantifying the 
enzymatic activity. Marker proteins without enzymatic activity may be quantified by other 
means e.g., by immunological methods. 

30 

The transgenic non-human animals are further useful to monitor disease 
development and progression in established models of disease involving spontanously 
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developing or induced disease conditions in wild-type or transgenic animals. For this, the 
transgenic non-human animals may be crossed with animal strains known for their use as 
animal models. Examples for suitable animal strains include mouse strains, e.g., 
atherosclerosis prone mice, e.g., targeted disruption of e.g., LDL-R or Apo E; mouse 
5 models for inflammation, e.g.. mouse strains showing defects in NFkB signalling, defects 
in TNF and TNF-R, IL-1 and IL-1-R, or mutations such as the MRL-lpr mouse; mouse 
strains with immune disorders, e.g., cytokine transgenic or knockout mice; Thrombosis- 
prone mice, e.g., plasminogen or plasminogen activator deficient mice; and mouse strains 
which develop tumors. Disease progression may be monitored by e.g., following marker 
10 gene expression in the F1 animals or in offsprings resulting from crossing in of the 

transgenic non-human animals for several generations into the respective disease model 
strain. 

The transgenic animals of the invention can also be used to study tissue 
15 distribution of E-selectin expression by various analytical methods including in situ 

histological analysis of the expression pattern of the reporter gene, and in vitro detection of 
the reporter gene in various tissues, tissue sections and/or cell types. 

The invention further contemplates isolated cells or a cell culture derived from a 
20 transgenic animal (e.g., mouse) of the invention. The cells can be obtained directly from 
the animal, from a descendent animal, or can be a progeny of a primary culture of one or 
more cells of the animal. The isolated cell can be in the form of a single cell or cell line, or 
a composition of mixed cells. The cell can be obtained from an animal which is the 
descendant of a transgenic animal of this invention, such as by a cross with another 
25 animal having either the same or a different genetic background. Thus an isolated cell can 
be either homozygous or heterozygous for a reporter gene in the E-selectin allele. The 
isolated cell is preferably an endothelial cell. 

In accordance with the foregoing the present invention thus provides 
30 (1 ) A transgenic or somatic recombinant non-human animal, e.g., a mammal, e.g., 

a pig, a primate, such as a monkey, or a rodent, such as a mouse or a rat, wherein 
expression of an exogenous soluble marker, e.g.. human growth hormone or a modified, 
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secreted heat stable alkaline phosphatase, specifically by endothelial cells, e.g., vascular 
endothelial cells, is regulatable by a chemical stimulus, including a cytokine, or a physical 
stimulus including light or temperature, e.g., a transgenic non-human animal comprising a 
polynucleotide encoding a soluble marker protein functionally linked to a regulatory 
5 sequence of an endogenous gene encoding E-selectin; and isolated cells of said animal. 

(2) A transgenic non-human animal wherein in the presence of a stimulus, e.g., an 
inflammatory cytokine, a soluble marker is specifically produced by endothelial cells. 

(3) A transgenic non-human animal comprising a recombinant DNA construct 
comprising a polynucleotide having a nucleotide sequence as illustrated in SEQ ID NO:11 

10 or SEQ ID NO:12. 

(4) A non-human animal being an offspring of a transgenic non-human animal as 
under (1), (2) or (3) crossed with a non-human animal showing a disease condition and/or 
its sequellae, e.g., symptoms, resulting from an altered genetic background, either 
spontaneously developed or following genetic manipulation. 

15 (5) A transgenic non-human animal cell comprising a recombinant DNA construct 

comprising a polynucleotide having a nucleotide sequence as illustrated in SEQ ID NO:11 
orSEQIDNO:12. 

(6) A recombinant DNA construct comprising a polynucleotide having a nucleotide 
sequence as illustrated in SEQ ID NO:9 or SEQ ID NO: 10. 

20 (7) A method for testing a potential therapeutic agent for a specified condition, in 

particular a disorder associated with inflammatory, thrombotic, ischaemic or neoplastic 
conditions or transplant rejection, wherein the agent is administered to a transgenic non- 
human animal as under (1), (2) or (3), monitoring marker concentration in a body fluid, 
e.g., plasma, and comparing the result with the result obtained from an untreated 

25 transgenic or somatic recombinant non-human animal as under (1 ), (2) or (3), an elevated 
level being indicative of the potential of the agent. 

(8) A method for screening of modulators of E-selectin expression comprising the 
steps of 

(a) administering the potential modulator to a transgenic or somatic recombinant 
30 non-human animal as under (1 ), (2) or (3), 

(b) monitoring marker concentration in a body fluid, e.g., plasma, and 
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(c) comparing the result with the result obtained from an untreated transgenic or 
somatic recombinant non-human animal as under (1), (2) or (3), a modulated level being 
indicative of the therapeutic potential of the agent. 

(9) A novel modulator of E-selectin expression identified by a method as under (6) 

5 or (7). 

(10) A method for treating a patient suffering from an inflammatory, thrombotic, 
ischaemic or neoplastic condition or from transplant rejection comprising administering to 
the patient a pharmaceutical^ effective amount of a modulator as under (8). 

(11) A method to monitor disease development and progression in established 
10 models of disease involving spontanously developing or induced disease conditions in 

wild-type or transgenic animals comprising 

(a) crossing of the transgenic non-human animals as under (1), (2) or (3) with 
animal strains known for their use as animal models and 

(b) monitoring disease progression in offsprings in relation to marker concentration 
15 in a body fluid. 

The following examples illustrate the invention without limitation. 



Example 1 Preparation of transgenic mice expressing a secreted heat stable 
20 alkaline phosphatase gene under the control of the E-selectin regulatory 

seguence without the 3'UTR of E-selectin containing the mRNA 
destabilizing AUUUA repeats 



(a) Preparation of DNA construct 

25 

(i) Mutation of the ATG start codon 

The ATG start site of E-selectin is mutated from ...GTC ATG AAT... to ... GTC 
AAT TG AAT... to introduce a unique Mfe I restriction site (underlined) within the genomic 
E-selectin fragment for cloning of SEAP, as follows: 
30 A -6kb Xbal genomic mouse E-selectin clone (D3-ES cells) in the pBS-KS(-) vector 

(Stratagene) is used to modify the ATG start codon by PCR. The 5-fragment is amplified 
by PCR using the primerpair SEQ ID NO:1 and SEQ ID NO:2 [50 nl standard reactions are 

13 

Case31176/A 



prepared using 400 nM primer, 200 nM dNTP's, PCR buffer with 1.5 mM MgCI 2 , 1.5 U Taq 
DNA polymerase in a PTC-200 thermocycler]. The 3'-fragment is amplified with the 
primerpair SEQ ID NO:3 and SEQ ID NO:4. The two fragments are purified using agarose 
gel electrophoresis. 200 fmol of each partial fragment is used for a PCR pre-amplification 
5 without primers to generate the full-length fragment (4x 94° 60"; 40° 120"; 72° 120"). The 
full-length fragment is amplified using primers SEQ ID NO:1 and SEQ ID NO:2 in a 
standard PCR reaction. The PCR fragment is gel purified, digested with Afl-2, the DNA 
precipitated using sodium acetate/ethanol, cut with BamH1 and purified by agarose 
electrophoresis. 

10 

(ii) mE-selectin with mutated ATG (mE-Sel/mutATG). 

Genomic Xbal/mE-Sel/pBS-KS(-) clone (is partially cut with BamH1, the 8.5kb 
fragment isolated using gel electrophoresis and digested with Afl-2. The 8 kb fragment is 
purified and ligated with the 520 bp PCR fragment obtained in (i) above carrying the 
15 mutated ATG start codon. The ligation mixture is precipitated (sodium acetate/ethanol) and 
redissolved in TE Buffer pH 8.0. 2 |il (20%) of the ligation is electroporated into E.coli XL1- 
blue electrocompetent cells and clones screened by PCR followed by precipitation and 
digestion. Several positive clones are verified by sequencing using standard protocol. 

20 (iii) mE-selectin with mutated ATG/Neo cassette (mE-Sel/mutATG/Neo). 

2.3 }ig plasmid mE-Sel/mutATG is linearised with Afl-2 and blunt ended with 
Klenow DNA polymerase using standard protocol and the gel purified vector is 
dephosphorylated with Shrimp alkaline phosphatase (SAP). The Neo cassette is isolated 
by digesting 5 ^ig plasmid pRay-2 [Storck et al., Nucleic Acids Res. 24:4594-4596 (1996)] 

25 with Xho-1 , the DNA precipitated cut with Xma-1 . The 1400 bp NeoloxP fragment is 
purified by gel electrophoresis and 0.5 ng of the isolated fragment is blunt ended using 
Klenow. 7 fmol Vector and 20 fmol NeoloxP fragment are ligated und electroporated into 
E. coli XL1 -blue, clones selected by PCR (SEQ ID NO:1 and SEQ ID NO:2). Several 
isolated plasmids are controlled by restriction digest (Sph1, Xba1, Nco1, BamH1) and for a 

30 3' -5' orientation of the NeoloxP insert. 



(iv) mE-Selectin with SEAP-2 in mutated ATG site (mE-Sel/SEAP-2/NeoV 
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5 ng plasmid mE-Sel/mutATG/Neo is linearized with Mfe-1, purified by gel 
electrophoresis and the isolated vector dephosphorylated with SAP. The SEAP-2 
fragment is isolated from 5 ng plasmid "pSEAP2 M (Accession No: U09660; Clontech) by 
EcoR1 and Mfe-1 digestion. The 1703 bp fragment is purified using gel electrophoresis. 5 
5 fmol vector and 20 fmol SEAP-2 are ligated and electroporated into E.coli XL1-blue as 
described above. Clones are selected by plasmid digestion using Nde-1; BamH1; Xba-1 
and the sequence of several positive clones is verified by DNA sequencing. Figure 1 
schematically depicts the insertion of the SEAP-2 fragment into linearized mE- 
Sel/mutATG/Neo. 

10 

v) Generation of targeting construct SEQ ID NO:9. 

6.6 jig plasmid mE-Sel/SEAP-2/Neo is linearized with BpulOI and the 6370bp 
fragment isolated by gel electrophoresis and blunt ended with Klenow. 2 \ig plasmid pBS2- 
SK(-) (Stratagene) is linearised with EcoRV, purified and dephosphorylated with SAP. 30 

15 fmol BpulOI fragment are ligated with 15 fmol vector pBS2SK and electroporated into 
E.coli XL 1 -blue. Clones are selected by Xba-1, BamH1 , and EcoR1 digestion. 120 jig 
endotoxin-free plasmid Tar-mE-Sel SEAP-2/Neo is linearised with Not-1, heat inactivated, 
precipitated and redissolved in 70 jJ TE Buffer low EDTA (O.lmmolar). The resulting 
targeting construct is a polynucleotide having SEQ. ID. NO:9. Figure 2(a) depicts the 

20 SEAP-2 targeting construct and its insertion site in the endogenous E-selectin locus. 

(b) Generation of E-selectin-SEAP knock-in BALB/c ES cell lines. 

5x1 0 6 BALB/c ES cells [Noben-Trauth et al.Jransgenic Res. 5:487-491 (1996)] are 
25 electroporated with 30 ng of the linearized targeting construct. Transfected cells are 

selected with G418 (200 ng/ml). Figure 2(b) schematically depicts the targeting construct 
as inserted into the targeted locus of the endogenous E-selectin gene. G418-resistant 
clones are screened for homologous recombination events by PCR [the ES cells are lysed 
1h/37° with 20 ^il Lysis buffer (PCR buffer 1X; SDS 1.7 jaM; Proteinase K 50 ^ig/ml) heat 
30 inactivated 85715 Min. and cleared by centrifugation. 1,3 jil lysed solution is used in for a 
50 ^l nested PCR. PCR (1): primer pair SEQ ID NO:5 and SEQ ID NO:6 (20 cycles) 
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followed by PCR (2): 1.3ul PCR (1) and primer pair SEQ ID NO:7 and SEQ ID NO:8 using 
25 cycles.] and positive clones are further verified by Southern analysis. 

Targeted ES lines are subjected to a further electroporation with a plasmid carrying 
5 the Cre gene under the transcriptional control of the HSV tk-promoter. Individual clones 
are grown in duplicate 96-well plates in the presence or absence of G418. The targeted 
locus after Cre-mediated neo cassette removal (leaving one loxP locus) is depicted in 
Figure 2(c). The absence of the neo selection cassette is verified by Southern blot 
analysis of individual G418 sensitive clones. 

10 

(c) Generation of E-selectin-SEAP knock-in mice 

BALB/c-ES cell clones carrying one E-selectin-SEAP allele are injected into BL/6-III 
host blastocysts and transferred into pseudopregnant foster mothers according to standard 
protocols. Chimeras are mated with BALB/c females and albino offspring (indicative for 
15 germ line transmission) are analyzed by PCR for target integration [200 ng genomic DNA 
are used in the same nested PCR as described in (vi) but in 30 nl reactions] and Southern 
analysis. Heterozygous animals are generated by back-crossing of F1 animals to Balb/c 
wild type animals and Southern analysis of the F2 animals. The homozygous lines are 
established by mating heterozygous F1 animals. 

20 

Example 2 Preparation of transgenic mice expressing a secreted heat stable alkaline 
phosphatase gene under the control of an E-selectin regulatory seguence 
maintaining the 3'UTR of E-selectin containing the mRNA destabilizing 
AUUUA repeats 

25 

(a) Preparation of DNA construct. 

fi) mE-Selectin with SEAP-1 in mutated ATG site (mE-Sel/SEAP-1/Neo). 
5 \xg plasmid obtained in Example 1(a)(ii) is cut with EcoR1 and Bsm-1 and the 
30 1615 bp fragment purified by gel electrophoresis. 45 fmol SEAP-1 fragment is ligated into 
5 fmol mE-Sel/mutATG/Neo, linearized with Mfe-1 and dephosphorylated, using 22 fmol 
Linker Bsm-1/Mfe-1 (GTTTAA). Figure 1 schematically depicts the insertion of the SEAP- 
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1 fragment into linearized mE-Sel/mutATG/Neo. Clone selection and sequencing are done 
as described above. The selected clone has a deletion of two base pairs (CA) in the Bsm1 
site located 40 bp 3 1 of the Stop codon. 

5 (ii) Generation of Targeting construct SEQ ID NO: 10. 

5 jig plasmid mE-Sel/SEAP-1/Neo are treated as described above. 120 ng 
endotoxin-free plasmid Tar-mE-Sel SEAP-1/Neo is linearised with Not-1, heat inactivated, 
precipitated and redissolved in TE buffer (0.1 mM EDTA). The resulting targeting construct 
is a polynucleotide having SEQ. ID. NO: 10. Figure 2(a) depicts the SEAP-1 targeting 
10 construct and its insertion site in the endogenous E-selectin locus. 

(b) Generation of E-selectin-SEAP knock-in BALB/c ES cell lines and mice. 

Cell lines and mice are generated following the protocol as described in Example 1 
(b) and (c) above. 

15 

Test 1 : LPS and cytokine induction of SEAP expression in heterozygous mice. 
Heterozygous mice of Example 1 and Example 2 are injected with LPS (Sigma, 
Serotype 0:1 1 1 B4 Cat L4130 ) at a dose of 25 ^ig per mouse. Murine recombinant TNFa 
(R&D Systems, Cat No. 410-MT) is diluted in sterile saline and administered at a dose of 
20 500 ng/mouse. Administration is intravenously, e.g., into the tail vein, subcutaneously or 
intraperitoneally. 

Blood for analysis is taken from a superficial vein of the mouse into heparinized 
glass hematocrit capillaries. The blood is centrifuged in a table-top Eppendorff centrifuge 
25 at maximal speed for 5 min to prepare plasma. 

A standard curve is created with human placental alkaline phosphatase (PAP) 
(Sigma Cat. P-3895). A stock solution of 2 mg/ml dissolved in buffer (Tris/HCI 50 mM pH 
7.4; NaCI 150 mM; BSA 1 %) is stored at -80°C in aliquots. The standard curve is diluted in 
30 mouse plasma containing Heparin. The range of the standard curve is between 0.5 and 
50000 ng/mL PAP. 
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For the enzyme assay the Tropix Kit {Tropix, Cat. No BP300) is used. The analysis 
of heat stable alkaline phosphatase activity in standards and samples is performed in 
duplicates according to the manufacturer's suggestion. 

5 The results are plotted in Figures 3 and 4. Figure 3 shows that animals of Example 1 

and Example 2 display significantly increased levels of circulating SEAP 6 hours after the 
administration of LPS as well as TNFa. The maximal levels of plasma SEAP are reached 
at 6 hours post-stimulation in animals of Example 2, the maximum reached in animals of 
Example 1 is not seen until 24 hours post stimulation. SEAP-2 mice express about 1000- 

10 fold more marker enzyme than SEAourP-1 animals. NaCI and IFNg (which are known not 
to induce E-selectin in vitro) have no effect on SEAP induction in either line, demonstrating 
that SEAP expression follows the same stimulation pattern as E-selectin. Figure 4 shows 
pooled data obtained at the 6 hour time point from n experiments. The plasma levels and 
the stimulation factors are shown (means +/- SEM). In the SEAP-1 line, LPS-or TNF- 

15 stimulated SEAP expression at 6 hours is higher than in the SEAP-2 line (about 10-fold vs. 
about 3-fold). 

Test 2: LPS and cytokine induction of SEAP expression in homozygous mice. 
The baseline plasma levels of SEAP is consistently and significantly higher in the 
20 homozygous compared to the heterozygous animals. The level of expression 6 hours after 
TNFa challenge is about 3-fold higher in the homozygous mice compared to the 
heterozygous animals of Example 1 and Example 2 while no change is seen in either line 
in saline-injected control animals. Figure 5 depicts a comparison of blood samples taken 
from heterozygous and homozygous mice treated with murine TNFa (500 ng/mouse) at 6 
25 hours after stimulation (means +/- SEM of n animals per group). 

Test 3: Analysis of the site of SEAP expression. 
Animals from Example 1 and Example 2 are treated with either LPS or saline, killed 
after 6 hours and organs removed for histology. Wild-type animals serve as controls. 

30 

After fixation for 3 hours in 4 % PBS-buffered paraformaldehyde, tissues are 
infused with sucrose over night, embedded in OCT compound and snap frozen. The 
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following results are obtained with tongue tissue. Cryosections (8 jam) are taken on 
polylysine coated slides, briefly dried and then stored at -80°C. For enzyme histochemistry, 
slides are brought to room temperature and then incubated for 45 min at 65°C in 50 mM 
Tris/100 mM NaCI. The sections are cooled to room temperature and further incubated in 
5 alkaline phosphatase substrate mixture prepared as follows: 100 mg nitro tetrazolium blue 
and 30 mg 5-bromo-4-chloro-3-indolyl phosphate are each dissolved in 5 ml N,N- 
dimethylformamide and then added to 90 ml 200 mM Tris buffer pH 9.2. The sections are 
incubated at 37°C for 3-4 h, briefly washed, counter-stained and inspected under a 
microscope. Immunohistochemistry is performed with anti CD31 and anti E-selectin 
10 antibody. 

In the non-heat inactivated tissue sections from mouse tongue very strong alkaline 
phosphatase activity is observed in several cell types. In the sections which were heat- 
inactivated, the background alkaline phosphatase activity is absent in the wild type control 

15 animals as well as in the SEAP-1 animals of Example 2 and, as shown in Figure 6, in the 
unstimulated SEAP-2 animal of Example 1 (boxes marked "SEAP-2 control"). In the LPS- 
treated animal of Example 1 (Figure 6, boxes marked "SEAP-2 LPS") a specific staining is 
seen in the endothelium of some but not all vessels. The stained vessels are the same as 
those which express E-selectin after LPS stimulation (se Figure 6, boxes marked "LPS"). 

20 Thus heat stable alkaline phosphatase enzyme activity is detected upon LPS stimulation in 
SEAP-2 mice in the same vessels as those which in wild type animals express E-selectin. 

Test 4: Elimination Kinetics of SEAP . The elimination kinetics of SEAP are 
investigated in an experiment in which wild-type mice are injected i.v. with human alkaline 
25 phosphatase, and plasma samples are analyzed. A conventional PK method is employed 
to analyze plasma concentration data and to estimate amount of SEAP expressed after 
stimulation, yielding the following data: 
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TNF LPS 



Animal Group 


Parameters 


mean 


SD 


mean 


SD 


SeAP-1 


AUC (ng h/mL) 


3250 


1040 


3990 


660 




Expression (ng) 


321 


102 


393 


65 


SeAP-2 


AUC (ng h/mL) 


2740 


700 


3930 


160 




Expression (? g) 


270 


69 


388 


16 



Endogenously produced SEAP is found to distribute mainly into the blood stream. 
As depicted in Figure 7, the elimination half-life of SEAP is about 10 hours. 

5 

Test 5: Effect of inhibitory compounds on SEAP expression. 
Clasto-lactacystine-p-lactone, a commercially available proteasome inhibitor is 
dissolved in DMSO (10 jnl/100|ig) and further diluted with 90 \il Peg300. This mixture is 
further diluted 1:10 in saline and injected i.v. to animals of Example 2 at a dose of 1 mg/kg. 

10 Ten to 15 min later TNF is administered i.p. to compound-treated or untreated animals of 
Example 2. Blood samples are taken from the animals at baseline, 6 hours and 24 hours 
and SEAP levels analyzed. A significant inhibition of marker enzyme expression is 
observed 6 hours after TNF administration in the animals which received clasto- 
lactacystine-p-lactone and TNF compared to the animals which received TNF alone 

15 (p=0.045, two-way ANOVA). Figure 8 shows that in SEAP-1 animals, the proteasome 
inhibitor significantly inhibits marker enzyme expression after TNF stimulation at 6 hours, 
but shows no effect at 24 hours in this mouse line. In SEAP-2 animals the compound does 
not modify SEAP expression at 6 hours or 24 hours after TNF simulation 

20 Test 6: Time course of endothelial activation during atherosclerotic lesion 

development. 

Homozygous mice derived from animals from Example 1 or Example 2 are mated 
to homozygous mouse strains prone to atherosclerosis, e.g., LDL-R -/- or apoE -/-. The 
offspring of these animals are fed a high cholesterol diet. At weekly intervals blood 
25 samples are taken and marker enzyme is quantified. Mice show an increase in marker 
enzyme prior to fatty lesion development. 
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Test 7: Endothelial cell activation during progression of inflammatory disease. 
Animals from Example 1 or Example 2 are crossed with mouse strains predisposed 
to spontaneous development of inflammatory disorders e.g.. the MRL-lpr mouse strain. 
Blood samples are taken from offspring at daily-to weekly time intervals and marker 
enzyme expression is monitored. The level of marker enzyme expression is correlated with 
disease onset. 
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SEQUENCE LISTING 



<110> Novartis AG 

5 <120> Transgenic animals for studying regulation of genes 

<130> SEAP mice 

<140> 
10 <141> 

<160> 12 

<170> Patentln Ver. 2.1 

15 

<210> 1 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

20 

<220> 

<223> Description of Artificial Sequence -.primer 
<400> 1 

25 gcagtgctca gtggtcctcg a 21 

<210> 2 
<211> 27 
30 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : primer 
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<400> 2 

tcaattgact ccagcagttg ctttcat 27 
<210> 3 
5 <211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> Description of Artificial Sequence : primer 
<400> 3 

gaaagcaact gctggagtca attgaatgcc tcgcgctttc tc 42 

15 

<210> 4 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

20 

<220> 

<223> Description of Artificial Sequence : primer 
<400> 4 

25 gctaatcacc atataacgca c 21 



<210> 5 
<211> 28 
30 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : primer 
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<400> 5 

cctttccaac ttgcttcctt tatctgag 



28 



5 

<210> 6 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

10 

<220> 

<223> Description of Artificial Sequence : primer 
<400> 6 

15 agtttgtcct tcttctgccc ttttag 26 



<210> 7 
<211> 18 
20 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : primer 



25 



<400> 7 

cctctccccg tggcatcc 18 



30 <210> 8 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 



<400> 8 

5 aagtccgggt tctcctcctc aa 22 

<210> 9 
<211> 6408 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : targeting 
sequence 

15 



20 



25 



30 



<400> 9 














cgataagctt 


gatttagcat 


ataatcagcg 


gtctaggtca 


ctaagtaggt 


actgtgtcac 


60 


tacagatacc 


atattaccac 


tgttgaaacg 


agaaatctag 


tatgaataga 


acatgagaag 


120 


aaacattcag 


gaatgactcc 


cctgcacccc 


agtgcctgtt 


aggatggcct 


aaggtagcag 


180 


tgactgagca 


gaaccttggc 


actgatgtac 


gtatcatcaa 


cagttatcca 


caagaaaatc 


240 


gagagtgaca 


gatcatgggt 


aataaaatac 


tcttagtttg 


ctcatctttg 


tgaaaccaag 


300 


ttctacactt 


gattgataat 


ggtaattaga 


ctttataaca 


caagacttaa 


catcacacac 


360 


ttgtatccca 


tttgataaaa 


ctgcatacaa 


atataagagc 


ccagaaaaag 


aattaattta 


420 


gaggggcaga 


gtctctgaca 


tcactatgaa 


agtgttttaa 


ctcagtggat 


attcccagaa 


480 


aactttttgg 


atgcagttga 


gaatttcctc 


ttagccagat 


tttgataata 


atgtcctatg 


540 


actcacagga 


agccagctcc 


cctataaaga 


ggctccagca 


gaagcagtgc 


tcagtggtcc 


600 


tcgactgact 


acacagcaaa 


actgcgagaa 


gaacggatag 


agagaagcag 


gagcaataca 


660 


cctaagggat 


ccaacgccag 


aacaacaatt 


ccactgaaca 


gtaagtgcgg 


attccgaacc 


720 


agaggagcct 


tgggcccgaa 


gcatagattt 


atgtgtgctc 


acatacgcaa 


aaaaaaaaaa 


780 


aaaaagtcat 


tctgattaaa 


aaagacctgt 


ctaaactgtt 


atagtgttgt 


aattaatacc 


840 


tacattttct 


ctttgtttaa 


aatgatggaa 


atcttaaagt 


ctgcatttat 


tctccatgct 


900 


agatgcctgt 


gaaataaggg 


atgaggtaca 


ggattctaaa 


acctcacagc 


tttgacttaa 


960 


atgctgttgt 


agaaggatgt 


gagtaacccg 


cacttagaat 


ttccttctgt 


gactaattct 


1020 


gcactttctc 


tcctcaggaa 


agtttctcca 


gtctagcgcc 


tggatgaaag 


caactgctgg 


1080 
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agtcaattcg cccaccatgc 
cctgggcatc atcccagttg 
ggccctgggt gccgccaaga 
cttcctgggc gatgggatgg 
5 gaagaaggac aaactggggc 
tctgtccaag acatacaatg 
ctacctgtgc ggggtcaagg 
taaccagtgc aacacgacac 
agcagggaag tcagtgggag 

10 cacctacgcc cacacggtga 
ccgccaggag gggtgccagg 
gatcctaggt ggaggccgaa 
agatgactac agccaaggtg 
ggcgaagcgc cagggtgccc 

15 ggacccgtct gtgacccatc 
ccaccgagac tccacactgg 
gctgagcagg aacccccgcg 
tcatcatgaa agcagggctt 
tgagagggcg ggccagctca 

20 ctcccacgtc ttctccttcg 
ccctggcaag gcccgggaca 
ctatgtgctc aaggacggcg 
gtatcggcag cagtcagcag 
ggtgttcgcg cgcggcccgc 

25 agcgcacgtc atggccttcg 
ccccgccggc accaccgacg 
cttcgagcag acatgataag 
tgaaaaaaat gctttatttg 
agctgcaata aacaagttaa 

30 tttttggtaa gctggagtca 
aaccggggta ctcgacctgc 
ctaataactt cgtatagcat 
ggaattcctc gacggatccg 
tattgccgat cccctcagaa 



tgctgctgct gctgctgctg 
aggaggagaa cccggacttc 
agctgcagcc tgcacagaca 
gggtgtctac ggtgacagct 
ctgagatacc cctggccatg 
tagacaaaca tgtgccagac 
gcaacttcca gaccattggc 
gcggcaacga ggtcatctcc 
tggtaaccac cacacgagtg 
accgcaactg gtactcggac 
acatcgctac gcagctcatc 
agtacatgtt tcgcatggga 
ggaccaggct ggacgggaag 
ggtatgtgtg gaaccgcact 
tcatgggtct ctttgagcct 
acccctccct gatggagatg 
gcttcttcct cttcgtggag 
accgggcact gactgagacg 
ccagcgagga ggacacgctg 
gaggctaccc cctgcgaggg 
ggaaggccta cacggtcctc 
cccggccgga tgttaccgag 
tgcccctgga cgaagagacc 
aggcgcacct ggttcacggc 
ccgcctgcct ggagccctac 
ccgcgcaccc gggttactct 
atacattgat gagtttggac 
tgaaatttgt gatgctattg 
caacaacaat tgaatgcctc 
cgggagacca gggactctca 
agccaagcta gcttggctgg 
acattatacg aagttatatt 
aacaaacgac ccaacacccg 
gaactcgtca agaaggcgat 

26 



ggcctgaggc 


tacagctctc 


1140 


tggaaccgcg 


aggcagccga 


1200 


gccgccaaga 


acctcatcat 


1260 


gccaggatcc 


taaaagggca 


1320 


gaccgcttcc 


catatgtggc 


1380 


agtggagcca 


cagccacggc 


1440 


ttgagtgcag 


ccgcccgctt 


1500 


gtgatgaatc 


gggccaagaa 


1560 


cagcacgcct 


cgccagccgg 


1620 


gccgacgtgc 


ctgcctcggc 


1680 


tccaacatgg 


acattgacgt 


1740 


accccagacc 


ctgagtaccc 


1800 


aatctggtgc 


aggaatggct 


1860 


gagctcatgc 


aggcttccct 


1920 


ggagacatga 


aatacgagat 


1980 


acagaggctg 


ccctgcgcct 


2040 


ggtggtcgca 


tcgaccatgg 


2100 


atcatgttcg 


acgacgccat 


2160 


agcctcgtca 


ctgccgacca 


2220 


agctccatct 


tcgggctggc 


2280 


ctatacggaa 


acggtccagg 


2340 


agcgagagcg 


ggagccccga 


2400 


cacgcaggcg 


aggacgtggc 


2460 


gtgcaggagc 


agaccttcat 


2520 


accgcctgcg 


acctggcgcc 


2580 


agagtcgggg 


cggccggccg 


2640 


aaaccacaac 


tagaatgcag 


2700 


ctttatttgt 


aaccattata 


2760 


gcgctttctc 


tctgctcttg 


2820 


ccctcacccc 


agtgactctt 


2880 


acgtaaactc 


ctcttcagac 


2940 


aagggttatt 


gaatatgatc 


3000 


tgcgttttat 


tctgtctttt 


3060 


agaaggcgat 


gcgctgcgaa 


3120 
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tcgggagcgg cgataccgta 
tcagcaatat cacgggtagc 
ccacagtcga tgaatccaga 
tcgccatggg tcacgacgag 
5 agttcggctg gcgcgagccc 
gcttccatcc gagtacgtgc 
gtagccggat caagcgtatg 
gcaggagcaa ggtgagatga 
tcccttcccg cttcagtgac 

10 agccacgata gccgcgctgc 
ttgacaaaaa gaaccgggcg 
ccgattgtct gttgtgccca 
cctgcgtgca atccatcttg 
tcgctcggtg ttcgaggcca 

15 ccgccccgac tgcatctgcg 
ctcgacattg ggtggaaaca 
aaaccacact gctcgacatt 
ttcctcttga aaaccacact 
cctcttcaga cctaataact 

20 tgaatatgat cggaattcct 
gcgttatatg gtgattagca 
aa 999t99 at tttaagtgct 
aacaggcacc acgtagcatc 
gttggaatat aacattgcgg 

25 atagaaattc tagccccttg 
caacaaaagg accctataaa 
tctctttctt gacccaatcc 
gcctccagtg agctcatgac 
catctggtgg caattcagaa 

30 tcaccgagtt actactggat 
acggggaagc ctctgacaga 
caaagaaatg aggactgtgt 
aatgacgaga gatgtaacaa 
catggtggaa ggctggctct 



aagcacgagg aagcggtcag 
caacgctatg tcctgatagc 
aaagcggcca ttttccacca 
atcctcgccg tcgggcatgc 
ctgatgctct tcgtccagat 
tcgctcgatg cgatgtttcg 
cagccgccgc attgcatcag 
caggagatcc tgccccggca 
aacgtcgagc acagctgcgc 
ctcgtcctgc agttcattca 
cccctgcgct gacagccgga 
gtcatagccg aatagcctct 
ttcaatggcc gatcccatgg 
cacgcgtcac cttaatatgc 
tgttcgaatt cgccaatgac 
ttccaggcct gggtggagag 
gggtggaaac attccaggcc 
gctcgacctg cagccaagct 
tcgtatagca tacattatac 
cgattaagag caggatattt 
gaactgatga cctccctgca 
ttgtgaagag tcgttcatct 
caatgaagtt tgctttctct 
ttgccataag gcaccaccag 
agatgatcct cttcttactt 
ccaaaagcaa gacaacacga 
agttctcctc gctggagaga 
gtatgatgaa gccagtgcat 
caaggaagag atcaactacc 
tggaatcaga aaagtcaata 
ggaagctcag aactgggctc 
agagatttac atccaacgaa 
aaagaagctg gctctgtgct 
gtgtggaggc agcctgacag 

27 



cccattcgcc 


gccaagctct 


3180 


ggtccgccac 


acccagccgg 


3240 


tgatattcgg 


caagcaggca 


3300 


gcgccttgag 


cctggcgaac 


3360 


catcctgatc 


gacaagaccg 


3420 


cttggtggtc 


gaatgggcag 


3480 


ccatgatgga 


tactttctcg 


3540 


cttcgcccaa 


tagcagccag 


3600 


aaggaacgcc 


cgtcgtggcc 


3660 


gggcaccgga 


caggtcggtc 


3720 


acacggcggc 


atcagagcag 


3780 


ccacccaagc 


ggccggagaa 


3840 


taaaaaccct 


cctcgcaggg 


3900 


gaagtggacc 


tgggaccgcg 


3960 


aagacgctgg 


gcggggtttg 


4020 


gctttttgct 


tcctcttgca 


4080 


tgggtggaga 


ggctttttgc 


4140 


agcttggctg 


gacgtaaact 


4200 


gaagttatat 


taagggttat 


4260 


tgttgtgtgg 


ggctttctgt 


4320 


gggttctgcc 


ttgtggtggg 


4380 


ggctcctgaa 


a gggg9ctca 


4440 


gacttccatt 


gtactgtgag 


4500 


ataatggcac 


acggtgatat 


4560 


ttaaaatcaa 


tagtttgaac 


4620 


ttcaggacta 


aactttgcta 


4680 


gcacagcttg 


gtactacaat 


4740 


actgtcagcg 


ggactacaca 


4800 


ttaactccaa 


tctgaaacat 


4860 


acgtatggat 


ctgggtgggg 


4920 


caggtgaacc 


aaacaacaaa 


4980 


ccaaagactc 


gggcatgtgg 


5040 


acacaggtat 


gaagtttctg 


5100 


attgagagtt 


gtataaagga 


5160 
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attggtctct actagttttt 
ctcttagtac tgatttgtgt 
tatgtactta tatgtataca 
taaaaataaa atatgtggtt 
5 tatacaacag atgagactgg 
tgtcattaga gtcatctgga 
agcaccatac taattgactg 
tgcatcctgc agtggtcatg 
ccaccctggc ttcctgggac 

10 tctccaagat ggtaacaccc 
cccatgtgtt tcctctactg 
ctcctgggtc aaactgcaga 
cctgtagggt gactgacgat 
gctgaagctt agcgttttag 

15 ttccatagag cctggaccac 
cctctctcag atgcaaaggg 
gatttttcag ctgtgacttg 
tcccacccgt tcggcccctt 
tacctgccca gcagcatgga 

20 cctgctccag cctgccatgg 
tgccttaatc gaattcctgc 



tcaaagagat gattataagt 
gagtagaggc aagatatata 
tataataatt ctatcaatat 
tagttgtagc tattctgatg 
aacttgtatc tggcatgcca 
ttgcactgat tctaaattac 
tcctcatatg gactgtgttt 
gtgaatgcat agagaccatc 
ccaactgtga gcaaggtaag 
tctcctcccc cagatggctt 
agtaagagtg atatagtggc 
cctactaccc gtattactta 
tccagttgtc atgaaactaa 
ttccagagaa cgacaaagaa 
aggaaatgtt caatgggtga 
agagatgagc agacgtgtgt 
caaaccacag gaacaccctg 
cagctataat tcctcctgct 
gaccaccgtg cggtgtacgt 
taactctccc aatgcagtaa 
agcccggggg atccactagt 



gattttaata gtaattgcat 5220 
catatatgtg tgtgtgtatg 5280 
acagagtata tatatttgca 5340 
tgtatttatt accaccaaaa 5400 
atcaaaatct ccagcactaa 5460 
aaacaaacaa actcatgatg 5520 
tctgtagctt cgtgtaccaa 5580 
aatagttaca cctgcaagtg 5640 
tcttgtcctc acctgtttct 5700 
gtgatggctt gtcagtcggg 5760 
tgagactatg agatcagatc 5820 
tggcaccttg acaaagtagt 5880 
ataacatcat gtggcaatta 5940 
actgtttcat agtctgatgg 6000 
taattatggt gaccagcgtt 6060 
cttgattcgt tgttttgttt 6120 
actatggaag cctgaactgc 6180 
cctttggctg taaaaggggc 6240 
cctctggaga gtggagtgcg 6300 
acctcttcac tcctcctcat 6360 
tctagagc 6408 



<210> 10 
25 <211> 6314 
<212> DNA 

<213> Artificial Sequence 
<220> 

30 <223> Description of Artificial Sequence : targeting 
sequence 

<400> 10 

cgataagctt gatttagcat ataatcagcg gtctaggtca ctaagtaggt actgtgtcac 60 

28 
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tacagatacc atattaccac 
aaacattcag gaatgactcc 
tgactgagca gaaccttggc 
gagagtgaca gatcatgggt 
5 ttctacactt gattgataat 
ttgtatccca tttgataaaa 
gaggggcaga gtctctgaca 
aactttttgg atgcagttga 
actcacagga agccagctcc 

10 tcgactgact acacagcaaa 
cctaagggat ccaacgccag 
agaggagcct tgggcccgaa 
aaaaagtcat tctgattaaa 
tacattttct ctttgtttaa 

15 agatgcctgt gaaataaggg 
atgctgttgt agaaggatgt 
gcactttctc tcctcaggaa 
agtcaattcg cccaccatgc 
cctgggcatc atcccagttg 

20 ggccctgggt gccgccaaga 
cttcctgggc gatgggatgg 
gaagaaggac aaactggggc 
tctgtccaag acatacaatg 
ctacctgtgc ggggtcaagg 

25 taaccagtgc aacacgacac 
agcagggaag tcagtgggag 
cacctacgcc cacacggtga 
ccgccaggag gggtgccagg 
gatcctaggt ggaggccgaa 

30 agatgactac agccaaggtg 
ggcgaagcgc cagggtgccc 
ggacccgtct gtgacccatc 
ccaccgagac tccacactgg 
gctgagcagg aacccccgcg 



tgttgaaacg agaaatctag 
cctgcacccc agtgcctgtt 
actgatgtac gtatcatcaa 
aataaaatac tcttagtttg 
ggtaattaga ctttataaca 
ctgcatacaa atataagagc 
tcactatgaa agtgttttaa 
gaatttcctc ttagccagat 
cctataaaga ggctccagca 
actgcgagaa gaacggatag 
aacaacaatt ccactgaaca 
gcatagattt atgtgtgctc 
aaagacctgt ctaaactgtt 
aatgatggaa atcttaaagt 
atgaggtaca ggattctaaa 
gagtaacccg cacttagaat 
agtttctcca gtctagcgcc 
tgctgctgct gctgctgctg 
aggaggagaa cccggacttc 
agctgcagcc tgcacagaca 
gggtgtctac ggtgacagct 
ctgagatacc cctggccatg 
tagacaaaca tgtgccagac 
gcaacttcca gaccattggc 
gcggcaacga ggtcatctcc 
tggtaaccac cacacgagtg 
accgcaactg gtactcggac 
acatcgctac gcagctcatc 
agtacatgtt tcgcatggga 
ggaccaggct ggacgggaag 
ggtatgtgtg gaaccgcact 
tcatgggtct ctttgagcct 
acccctccct gatggagatg 
gcttcttcct cttcgtggag 
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tatgaataga 


acatgagaag 


120 


aggatggcct 


aaggtagcag 


180 


cagttatcca 


caagaaaatc 


240 


ctcatctttg 


tgaaaccaag 


300 


caagacttaa 


catcacacac 


360 


ccagaaaaag 


aattaattta 


420 


ctcagtggat 


attcccagaa 


480 


tttgataata 


atgtcctatg 


540 


gaagcagtgc 


tcagtggtcc 


600 


agagaagcag 


gagcaataca 


660 


gtaagtgcgg 


attccgaacc 


720 


acatacgcaa 


aaaaaaaaaa 


780 


atagtgttgt 


aattaatacc 


840 


ctgcatttat 


tctccatgct 


900 


acctcacagc 


tttgacttaa 


960 


ttccttctgt 


gactaattct 


1020 


tggatgaaag 


caactgctgg 


1080 


ggcctgaggc 


tacagctctc 


1140 


tggaaccgcg 


aggcagccga 


1200 


gccgccaaga 


acctcatcat 


1260 


gccaggatcc 


taaaagggca 


1320 


gaccgcttcc 


catatgtggc 


1380 


agtggagcca 


cagccacggc 


1440 


ttgagtgcag 


ccgcccgctt 


1500 


gtgatgaatc 


gggccaagaa 


1560 


cagcacgcct 


cgccagccgg 


1620 


gccgacgtgc 


ctgcctcggc 


1680 


tccaacatgg 


acattgacgt 


1740 


accccagacc 


ctgagtaccc 


1800 


aatctggtgc 


aggaatggct 


1860 


gagctcatgc 


aggcttccct 


1920 


ggagacatga 


aatacgagat 


1980 


acagaggctg 


ccctgcgcct 


2040 


ggtggtcgca 


tcgaccatgg 


2100 
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tcatcatgaa agcagggctt 
tgagagggcg ggccagctca 
ctcccacgtc ttctccttcg 
ccctggcaag gcccgggaca 
5 ctatgtgctc aaggacggcg 
gtatcggcag cagtcagcag 
ggtgttcgcg cgcggcccgc 
agcgcacgtc atggccttcg 
ccccgccggc accaccgacg 

10 cttcgagcag acatgataag 
tgcctcgcgc tttctctctg 
ctctcaccct caccccagtg 
ggctggacgt aaactcctct 
tatattaagg gttattgaat 

15 cacccgtgcg ttttattctg 
ggcgatagaa ggcgatgcgc 
ggtcagccca ttcgccgcca 
gatagcggtc cgccacaccc 
ccaccatgat attcggcaag 

20 gcatgcgcgc cttgagcctg 
ccagatcatc ctgatcgaca 
gtttcgcttg gtggtcgaat 
catcagccat gatggatact 
ccggcacttc gcccaatagc 

25 ctgcgcaagg aacgcccgtc 
cattcagggc accggacagg 
gccggaacac ggcggcatca 
gcctctccac ccaagcggcc 
ccatggtaaa aaccctcctc 

30 atatgcgaag tggacctggg 
aatgacaaga cgctgggcgg 
ggagaggctt tttgcttcct 
caggcctggg tggagaggct 
caagctagct tggctggacg 



accgggcact gactgagacg 
ccagcgagga ggacacgctg 
gaggctaccc cctgcgaggg 
ggaaggccta cacggtcctc 
cccggccgga tgttaccgag 
tgcccctgga cgaagagacc 
aggcgcacct ggttcacggc 
ccgcctgcct ggagccctac 
ccgcgcaccc gggttactct 
atacattgat gagtttggac 
ctcttgtttt tggtaagctg 
actcttaacc ggggtactcg 
tcagacctaa taacttcgta 
atgatcggaa ttcctcgacg 
tctttttatt gccgatcccc 
tgcgaatcgg gagcggcgat 
agctcttcag caatatcacg 
agccggccac agtcgatgaa 
caggcatcgc catgggtcac 
gcgaacagtt cggctggcgc 
agaccggctt ccatccgagt 
gggcaggtag ccggatcaag 
ttctcggcag gagcaaggtg 
agccagtccc ttcccgcttc 
gtggccagcc acgatagccg 
tcggtcttga caaaaagaac 
gagcagccga ttgtctgttg 
ggagaacctg cgtgcaatcc 
gcagggtcgc tcggtgttcg 
accgcgccgc cccgactgca 
ggtttgctcg acattgggtg 
cttgcaaaac cacactgctc 
ttttgcttcc tcttgaaaac 
taaactcctc ttcagaccta 
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atcatgttcg 


acgacgccat 


2160 


agcctcgtca 


ctgccgacca 


2220 


agctccatct 


tcgggctggc 


2280 


ctatacggaa 


acggtccagg 


2340 


agcgagagcg 


ggagccccga 


2400 


cacgcaggcg 


aggacgtggc 


2460 


gtgcaggagc 


agaccttcat 


2520 


accgcctgcg 


acctggcgcc 


2580 


agagtcgggg 


cggccggccg 


2640 


aaaccacaac 


tagaatggaa 


2700 


gagtcacggg 


agaccaggga 


2760 


acctgcagcc 


aagctagctt 


2820 


tagcatacat 


tatacgaagt 


2880 


gatccgaaca 


aacgacccaa 


2940 


tcagaagaac 


tcgtcaagaa 


3000 


accgtaaagc 


acgaggaagc 


3060 


ggtagccaac 


gctatgtcct 


3120 


tccagaaaag 


cggccatttt 


3180 


gacgagatcc 


tcgccgtcgg 


3240 


gagcccctga 


tgctcttcgt 


3300 


acgtgctcgc 


tcgatgcgat 


3360 


cgtatgcagc 


cgccgcattg 


3420 


agatgacagg 


agatcctgcc 


3480 


agtgacaacg 


tcgagcacag 


3540 


cgctgcctcg 


tcctgcagtt 


3600 


cgggcgcccc 


tgcgctgaca 


3660 


tgcccagtca 


tagccgaata 


3720 


atcttgttca 


atggccgatc 


3780 


aggccacacg 


cgtcacctta 


3840 


tctgcgtgtt 


cgaattcgcc 


3900 


gaaacattcc 


aggcctgggt 


3960 


gacattgggt 


ggaaacattc 


4020 


cacactgctc 


gacctgcagc 


4080 


ataacttcgt 


atagcataca 


4140 
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ttatacgaag ttatattaag 
atattttgtt gtgtggggct 
cctgcagggt tctgccttgt 
tcatctggct cctgaaaggg 
5 ttctctgact tccattgtac 
caccagataa tggcacacgg 
ttacttttaa aatcaatagt 
acacgattca ggactaaact 
gagagagcac agcttggtac 

10 gtgcatactg tcagcgggac 
actaccttaa ctccaatctg 
tcaataacgt atggatctgg 
gggctccagg tgaaccaaac 
aacgaaccaa agactcgggc 

15 tgtgctacac aggtatgaag 
tgacagattg agagttgtat 
ataagtgatt ttaatagtaa 
tatatacata tatgtgtgtg 
caatatacag agtatatata 

20 ctgatgtgta tttattacca 
atgccaatca aaatctccag 
aattacaaac aaacaaactc 
gtgttttctg tagcttcgtg 
accatcaata gttacacctg 

25 ggtaagtctt gtcctcacct 
tggcttgtga tggcttgtca 
agtggctgag actatgagat 
tacttatggc accttgacaa 
aactaaataa catcatgtgg 

30 aaagaaactg tttcatagtc 
gggtgataat tatggtgacc 
gtgtgtcttg attcgttgtt 
accctgacta tggaagcctg 
cctgctcctt tggctgtaaa 



ggttattgaa tatgatcgga 
ttctgtgcgt tatatggtga 
ggtgggaagg gtggatttta 
ggctcaaaca ggcaccacgt 
tgtgaggttg gaatataaca 
tgatatatag aaattctagc 
ttgaaccaac aaaaggaccc 
ttgctatctc tttcttgacc 
tacaatgcct ccagtgagct 
tacacacatc tggtggcaat 
aaacattcac cgagttacta 
gtggggacgg ggaagcctct 
aacaaacaaa gaaatgagga 
atgtggaatg acgagagatg 
tttctgcatg gtggaaggct 
aaaggaattg gtctctacta 
ttgcatctct tagtactgat 
tgtatgtatg tacttatatg 
tttgcataaa aataaaatat 
ccaaaatata caacagatga 
cactaatgtc attagagtca 
atgatgagca ccatactaat 
taccaatgca tcctgcagtg 
caagtgccac cctggcttcc 
gtttcttctc caagatggta 
gtcgggccca tgtgtttcct 
cagatcctcc tgggtcaaac 
agtagtcctg tagggtgact 
caattagctg aagcttagcg 
tgatggttcc atagagcctg 
agcgttcctc tctcagatgc 
ttgtttgatt tttcagctgt 
aactgctccc acccgttcgg 
aggggctacc tgcccagcag 
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attcctcgat 


taagagcagg 


4200 


ttagcagaac 


tgatgacctc 


4260 


agtgctttgt 


gaagagtcgt 


4320 


agcatccaat 


gaagtttgct 


4380 


ttgcggttgc 


cataaggcac 


4440 


cccttgagat 


gatcctcttc 


4500 


tataaaccaa 


aagcaagaca 


4560 


caatccagtt 


ctcctcgctg 


4620 


catgacgtat 


gatgaagcca 


4680 


tcagaacaag 


gaagagatca 


4740 


ctggattgga 


atcagaaaag 


4800 


gacagaggaa 


gctcagaact 


4860 


ctgtgtagag 


atttacatcc 


4920 


taacaaaaag 


aagctggctc 


4980 


ggctctgtgt 


ggaggcagcc 


5040 


gttttttcaa 


agagatgatt 


5100 


ttgtgtgagt 


agaggcaaga 


5160 


tatacatata 


ataattctat 


5220 


gtggtttagt 


tgtagctatt 


5280 


gactggaact 


tgtatctggc 


5340 


tctggattgc 


actgattcta 


5400 


tgactgtcct 


catatggact 


5460 


gtcatggtga 


atgcatagag 


5520 


tgggacccaa 


ctgtgagcaa 


5580 


acaccctctc 


ctcccccaga 


5640 


ctactgagta 


agagtgatat 


5700 


tgcagaccta 


ctacccgtat 


5760 


gacgattcca 


gttgtcatga 


5820 


ttttagttcc 


agagaacgac 


5880 


gaccacagga 


aatgttcaat 


5940 


aaagggagag 


atgagcagac 


6000 


gacttgcaaa 


ccacaggaac 


6060 


ccccttcagc 


tataattcct 


6120 


catggagacc 


accgtgcggt 


6180 
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gtacgtcctc tggagagtgg agtgcgcctg ctccagcctg ccatggtaac 
cagtaaacct cttcactcct cctcattgcc ttaatcgaat tcctgcagcc 
actagttcta gage 



tctcccaatg 6240 
egggggatec 6300 
6314 



<210> 11 
<211> 5083 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :DNA construct 



<400> 11 

15 ttagcatata ateageggtc 
ttaccactgt tgaaacgaga 
tgactcccct gcaccccagt 
ccttggcact gatgtacgta 
catgggtaat aaaatactct 

20 tgataatggt aattagactt 
gataaaactg catacaaata 
tctgacatca ctatgaaagt 
cagttgagaa tttcctctta 
cagctcccct ataaagaggc 

25 cagcaaaact gcgagaagaa 
acgccagaac aacaattcca 
gcccgaagca tagatttatg 
gattaaaaaa gacctgtcta 
tgtttaaaat gatggaaatc 

30 ataagggatg aggtacagga 
aggatgtgag taacccgcac 
tcaggaaagt ttctccagtc 
accatgctgc tgetgetget 
ccagttgagg aggagaaccc 



taggtcacta agtaggtact 
aatctagtat gaatagaaca 
gcctgttagg atggcctaag 
tcatcaacag ttatccacaa 
tagtttgetc atctttgtga 
tataacacaa gacttaacat 
taagagecca gaaaaagaat 
gttttaactc agtggatatt 
gecagatttt gataataatg 
tccagcagaa geagtgetea 
eggatagaga gaagcaggag 
ctgaacagta agtgcggatt 
tgtgctcaca tacgeaaaaa 
aactgttata gtgttgtaat 
ttaaagtctg catttattct 
ttctaaaacc tcacagcttt 
ttagaatttc cttctgtgac 
tagegectgg atgaaagcaa 
getgetggge ctgaggctac 
ggacttctgg aaccgegagg 
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gtgtcactac agataccata 60 
tgagaagaaa cattcaggaa 120 
gtagcagtga ctgagcagaa 180 
gaaaatcgag agtgacagat 240 
aaccaagttc tacacttgat 300 
cacacacttg tatcccattt 360 
taatttagag gggcagagtc 420 
cccagaaaac tttttggatg 480 
tcctatgact cacaggaagc 540 
gtggtcctcg actgactaca 600 
caatacacct aagggatcca 660 
ccgaaccaga ggagccttgg 720 
aaaaaaaaaa aagtcattct 780 
taatacctac attttctctt 840 
ecatgetaga tgcctgtgaa 900 
gacttaaatg ctgttgtaga 960 
taattctgea ctttctctcc 1020 
ctgctggagt caattcgccc 1080 
agctctccct gggcatcatc 1140 
cagccgaggc cctgggtgcc 1200 
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gccaagaagc tgcagcctgc 
gggatggggg tgtctacggt 
ctggggcctg agatacccct 
tacaatgtag acaaacatgt 
5 gtcaagggca acttccagac 
acgacacgcg gcaacgaggt 
gtgggagtgg taaccaccac 
acggtgaacc gcaactggta 
tgccaggaca tcgctacgca 

10 ggccgaaagt acatgtttcg 
caaggtggga ccaggctgga 
ggtgcccggt atgtgtggaa 
acccatctca tgggtctctt 
acactggacc cctccctgat 

15 ccccgcggct tcttcctctt 
agggcttacc gggcactgac 
cagctcacca gcgaggagga 
tccttcggag gctaccccct 
cgggacagga aggcctacac 

20 gacggcgccc ggccggatgt 
tcagcagtgc ccctggacga 
ggcccgcagg cgcacctggt 
gccttcgccg cctgcctgga 
accgacgccg cgcacccggg 

25 tgataagata cattgatgag 
ttatttgtga aatttgtgat 
aagttaacaa caacaattga 
99 a gtcacgg 9 a 9 a ccaggg 
gacctgcagc caagctagct 

30 atagcataca ttatacgaag 
taagagcagg atattttgtt 
tgatgacctc cctgcagggt 
gaagagtcgt tcatctggct 
gaagtttgct ttctctgact 



acagacagcc gccaagaacc 
gacagctgcc aggatcctaa 
ggccatggac cgcttcccat 
gccagacagt ggagccacag 
cattggcttg agtgcagccg 
catctccgtg atgaatcggg 
acgagtgcag cacgcctcgc 
ctcggacgcc gacgtgcctg 
gctcatctcc aacatggaca 
catgggaacc ccagaccctg 
cgggaagaat ctggtgcagg 
ccgcactgag ctcatgcagg 
tgagcctgga gacatgaaat 
ggagatgaca gaggctgccc 
cgtggagggt ggtcgcatcg 
tgagacgatc atgttcgacg 
cacgctgagc ctcgtcactg 
gcgagggagc tccatcttcg 
ggtcctccta tacggaaacg 
taccgagagc gagagcggga 
agagacccac gcaggcgagg 
tcacggcgtg caggagcaga 
gccctacacc gcctgcgacc 
ttactctaga gtcggggcgg 
tttggacaaa ccacaactag 
gctattgctt tatttgtaac 
atgcctcgcg ctttctctct 
actctcaccc tcaccccagt 
tggctggacg taaactcctc 
ttatattaag ggttattgaa 
gtgtggggct ttctgtgcgt 
tctgccttgt ggtgggaagg 
cctgaaaggg ggctcaaaca 
tccattgtac tgtgaggttg 
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tcatcatctt 


cctgggcgat 


1260 


aagggcagaa 


gaaggacaaa 


1320 


atgtggctct 


gtccaagaca 


1380 


ccacggccta 


cctgtgcggg 


1440 


cccgctttaa 


ccagtgcaac 


1500 


ccaagaaagc 


agggaagtca 


1560 


cagccggcac 


ctacgcccac 


1620 


cctcggcccg 


ccaggagggg 


1680 


ttgacgtgat 


cctaggtgga 


1740 


agtacccaga 


tgactacagc 


1800 


aatggctggc 


gaagcgccag 


1860 


cttccctgga 


cccgtctgtg 


1920 


acgagatcca 


ccgagactcc 


1980 


tgcgcctgct 


gagcaggaac 


2040 


accatggtca 


tcatgaaagc 


2100 


acgccattga 


gagggcgggc 


2160 


ccgaccactc 


ccacgtcttc 


2220 


ggctggcccc 


tggcaaggcc 


2280 


gtccaggcta 


tgtgctcaag 


2340 


gccccgagta 


tcggcagcag 


2400 


acgtggcggt 


gttcgcgcgc 


2460 


ccttcatagc 


gcacgtcatg 


2520 


tggcgccccc 


cgccggcacc 


2580 


ccggccgctt 


cgagcagaca 


2640 


aatgcagtga 


aaaaaatgct 


2700 


cattataagc 


tgcaataaac 


2760 


gctcttgttt 


ttggtaagct 


2820 


gactcttaac 


cggggtactc 


2880 


ttcagaccta 


ataacttcgt 


2940 


tatgatcgga 


attcctcgat 


3000 


tatatggtga 


ttagcagaac 


3060 


gtggatttta 


agtgctttgt 


3120 


ggcaccacgt 


agcatccaat 


3180 


gaatataaca 


ttgcggttgc 


3240 
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cataaggcac caccagataa tggcacacgg tgatatatag aaattctagc cccttgagat 3300 

gatcctcttc ttacttttaa aatcaatagt ttgaaccaac aaaaggaccc tataaaccaa 3360 

aagcaagaca acacgattca ggactaaact ttgctatctc tttcttgacc caatccagtt 3420 

ctcctcgctg gagagagcac agcttggtac tacaatgcct ccagtgagct catgacgtat 3480 

5 gatgaagcca gtgcatactg tcagcgggac tacacacatc tggtggcaat tcagaacaag 3540 

gaagagatca actaccttaa ctccaatctg aaacattcac cgagttacta ctggattgga 3600 

atcagaaaag tcaataacgt atggatctgg gtggggacgg ggaagcctct gacagaggaa 3660 

gctcagaact gggctccagg tgaaccaaac aacaaacaaa gaaatgagga ctgtgtagag 3720 

atttacatcc aacgaaccaa agactcgggc atgtggaatg acgagagatg taacaaaaag 3780 

10 aagctggctc tgtgctacac aggtatgaag tttctgcatg gtggaaggct ggctctgtgt 3840 

ggaggcagcc tgacagattg agagttgtat aaaggaattg gtctctacta gttttttcaa 3900 

agagatgatt ataagtgatt ttaatagtaa ttgcatctct tagtactgat ttgtgtgagt 3960 

agaggcaaga tatatacata tatgtgtgtg tgtatgtatg tacttatatg tatacatata 4020 

ataattctat caatatacag agtatatata tttgcataaa aataaaatat gtggtttagt 4080 

15 tgtagctatt ctgatgtgta tttattacca ccaaaatata caacagatga gactggaact 4140 

tgtatctggc atgccaatca aaatctccag cactaatgtc attagagtca tctggattgc 4200 

actgattcta aattacaaac aaacaaactc atgatgagca ccatactaat tgactgtcct 4260 

catatggact gtgttttctg tagcttcgtg taccaatgca tcctgcagtg gtcatggtga 4320 

atgcatagag accatcaata gttacacctg caagtgccac cctggcttcc tgggacccaa 4380 

20 ctgtgagcaa ggtaagtctt gtcctcacct gtttcttctc caagatggta acaccctctc 4440 

ctcccccaga tggcttgtga tggcttgtca gtcgggccca tgtgtttcct ctactgagta 4500 

agagtgatat agtggctgag actatgagat cagatcctcc tgggtcaaac tgcagaccta 4560 

ctacccgtat tacttatggc accttgacaa agtagtcctg tagggtgact gacgattcca 4620 

gttgtcatga aactaaataa catcatgtgg caattagctg aagcttagcg ttttagttcc 4680 

25 agagaacgac aaagaaactg tttcatagtc tgatggttcc atagagcctg gaccacagga 4740 

aatgttcaat gggtgataat tatggtgacc agcgttcctc tctcagatgc aaagggagag 4800 

atgagcagac gtgtgtcttg attcgttgtt ttgtttgatt tttcagctgt gacttgcaaa 4860 

ccacaggaac accctgacta tggaagcctg aactgctccc acccgttcgg ccccttcagc 4920 

tataattcct cctgctcctt tggctgtaaa aggggctacc tgcccagcag catggagacc 4980 

30 accgtgcggt gtacgtcctc tggagagtgg agtgcgcctg ctccagcctg ccatggtaac 5040 

tctcccaatg cagtaaacct cttcactcct cctcattgcc tta 5083 

<210> 12 
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<211> 4989 
<212> DNA 

<213> Artificial Sequence 



5 <220> 

<223> Description of Artificial Sequence : DNA construct 



<400> 12 





ttagcatata 


atcagcggtc 


taggtcacta 


agtaggtact 


gtgtcactac 


agataccata 


60 


10 


ttaccactgt 


tgaaacgaga 


aatctagtat 


gaatagaaca 


tgagaagaaa 


cattcaggaa 


120 




tgactcccct 


gcaccccagt 


gcctgttagg 


atggcctaag 


gtagcagtga 


ctgagcagaa 


180 




ccttggcact 


gatgtacgta 


tcatcaacag 


ttatccacaa 


gaaaatcgag 


agtgacagat 


240 




catgggtaat 


aaaatactct 


tagtttgctc 


atctttgtga 


aaccaagttc 


tacacttgat 


300 




tgataatggt 


aattagactt 


tataacacaa 


gacttaacat 


cacacacttg 


tatcccattt 


360 


15 


gataaaactg 


catacaaata 


taagagccca 


gaaaaagaat 


taatttagag 


gggcagagtc 


420 




tctgacatca 


ctatgaaagt 


gttttaactc 


agtggatatt 


cccagaaaac 


tttttggatg 


480 




cagttgagaa 


tttcctctta 


gccagatttt 


gataataatg 


tcctatgact 


cacaggaagc 


540 




cagctcccct 


ataaagaggc 


tccagcagaa 


gcagtgctca 


gtggtcctcg 


actgactaca 


600 




cagcaaaact 


gcgagaagaa 


cggatagaga 


gaagcaggag 


caatacacct 


aagggatcca 


660 


20 


acgccagaac 


aacaattcca 


ctgaacagta 


agtgcggatt 


ccgaaccaga 


ggagccttgg 


720 




gcccgaagca 


tagatttatg 


tgtgctcaca 


tacgcaaaaa 


aaaaaaaaaa 


aagtcattct 


780 




gattaaaaaa 


gacctgtcta 


aactgttata 


gtgttgtaat 


taatacctac 


attttctctt 


840 




tgtttaaaat 


gatggaaatc 


ttaaagtctg 


catttattct 


ccatgctaga 


tgcctgtgaa 


900 




ataagggatg 


aggtacagga 


ttctaaaacc 


tcacagcttt 


gacttaaatg 


ctgttgtaga 


960 


25 


aggatgtgag 


taacccgcac 


ttagaatttc 


cttctgtgac 


taattctgca 


ctttctctcc 


1020 




tcaggaaagt 


ttctccagtc 


tagcgcctgg 


atgaaagcaa 


ctgctggagt 


caattcgccc 


1080 




accatgctgc 


tgctgctgct 


gctgctgggc 


ctgaggctac 


agctctccct 


gggcatcatc 


1140 




ccagttgagg 


aggagaaccc 


ggacttctgg 


aaccgcgagg 


cagccgaggc 


cctgggtgcc 


1200 




gccaagaagc 


tgcagcctgc 


acagacagcc 


gccaagaacc 


tcatcatctt 


cctgggcgat 


1260 


30 


gggatggggg 


tgtctacggt 


gacagctgcc 


aggatcctaa 


aagggcagaa 


gaaggacaaa 


1320 




ctggggcctg 


agatacccct 


ggccatggac 


cgcttcccat 


atgtggctct 


gtccaagaca 


1380 




tacaatgtag 


acaaacatgt 


gccagacagt 


ggagccacag 


ccacggccta 


cctgtgcggg 


1440 




gtcaagggca 


acttccagac 


cattggcttg 


agtgcagccg 


cccgctttaa 


ccagtgcaac 


1500 




acgacacgcg 


gcaacgaggt 


catctccgtg 


atgaatcggg 


ccaagaaagc 


agggaagtca 


1560 
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gtgggagtgg taaccaccac 
acggtgaacc gcaactggta 
tgccaggaca tcgctacgca 
ggccgaaagt acatgtttcg 
5 caaggtggga ccaggctgga 
ggtgcccggt atgtgtggaa 
acccatctca tgggtctctt 
acactggacc cctccctgat 
ccccgcggct tcttcctctt 

10 agggcttacc gggcactgac 
cagctcacca gcgaggagga 
tccttcggag gctaccccct 
cgggacagga aggcctacac 
gacggcgccc ggccggatgt 

15 tcagcagtgc ccctggacga 
ggcccgcagg cgcacctggt 
gccttcgccg cctgcctgga 
accgacgccg cgcacccggg 
tgataagata cattgatgag 

20 ctctctgctc ttgtttttgg 
cccagtgact cttaaccggg 
ctcctcttca gacctaataa 
attgaatatg atcggaattc 
gtgcgttata tggtgattag 

25 ggaagggtgg attttaagtg 
caaacaggca ccacgtagca 
aggttggaat ataacattgc 
atatagaaat tctagcccct 
accaacaaaa ggaccctata 

30 tatctctttc ttgacccaat 
atgcctccag tgagctcatg 
cacatctggt ggcaattcag 
attcaccgag ttactactgg 
ggacggggaa gcctctgaca 



acgagtgcag cacgcctcgc 
ctcggacgcc gacgtgcctg 
gctcatctcc aacatggaca 
catgggaacc ccagaccctg 
cgggaagaat ctggtgcagg 
ccgcactgag ctcatgcagg 
tgagcctgga gacatgaaat 
ggagatgaca gaggctgccc 
cgtggagggt ggtcgcatcg 
tgagacgatc atgttcgacg 
cacgctgagc ctcgtcactg 
gcgagggagc tccatcttcg 
ggtcctccta tacggaaacg 
taccgagagc gagagcggga 
agagacccac gcaggcgagg 
tcacggcgtg caggagcaga 
gccctacacc gcctgcgacc 
ttactctaga gtcggggcgg 
tttggacaaa ccacaactag 
taagctggag tcacgggaga 
gtactcgacc tgcagccaag 
cttcgtatag catacattat 
ctcgattaag agcaggatat 
cagaactgat gacctccctg 
ctttgtgaag agtcgttcat 
tccaatgaag tttgctttct 
ggttgccata aggcaccacc 
tgagatgatc ctcttcttac 
aaccaaaagc aagacaacac 
ccagttctcc tcgctggaga 
acgtatgatg aagccagtgc 
aacaaggaag agatcaacta 
attggaatca gaaaagtcaa 
gaggaagctc agaactgggc 

36 



cagccggcac 


ctacgcccac 


1620 


cctcggcccg 


ccaggagggg 


1680 


ttgacgtgat 


cctaggtgga 


1740 


agtacccaga 


tgactacagc 


1800 


aatggctggc 


gaagcgccag 


1860 


cttccctgga 


cccgtctgtg 


1920 


acgagatcca 


ccgagactcc 


1980 


tgcgcctgct 


gagcaggaac 


2040 


accatggtca 


tcatgaaagc 


2100 


acgccattga 


gagggcgggc 


2160 


ccgaccactc 


ccacgtcttc 


2220 


ggctggcccc 


tggcaaggcc 


2280 


gtccaggcta 


tgtgctcaag 


2340 


gccccgagta 


tcggcagcag 


2400 


acgtggcggt 


gttcgcgcgc 


2460 


ccttcatagc 


gcacgtcatg 


2520 


tggcgccccc 


cgccggcacc 


2580 


ccggccgctt 


cgagcagaca 


2640 


aatggaatgc 


ctcgcgcttt 


2700 


ccagggactc 


tcaccctcac 


2760 


ctagcttggc 


tggacgtaaa 


2820 


acgaagttat 


attaagggtt 


2880 


tttgttgtgt 


ggggctttct 


2940 


cagggttctg 


ccttgtggtg 


3000 


ctggctcctg 


aaagggggct 


3060 


ctgacttcca 


ttgtactgtg 


3120 


agataatggc 


acacggtgat 


3180 


ttttaaaatc 


aatagtttga 


3240 


gattcaggac 


taaactttgc 


3300 


gagcacagct 


tggtactaca 


3360 


atactgtcag 


cgggactaca 


3420 


ccttaactcc 


aatctgaaac 


3480 


taacgtatgg 


atctgggtgg 


3540 


tccaggtgaa 


ccaaacaaca 


3600 
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aacaaagaaa tgaggactgt gtagagattt 
ggaatgacga gagatgtaac aaaaagaagc 
tgcatggtgg aaggctggct ctgtgtggag 
gaattggtct ctactagttt tttcaaagag 
5 atctcttagt actgatttgt gtgagtagag 
tgtatgtact tatatgtata catataataa 
cataaaaata aaatatgtgg tttagttgta 
aatatacaac agatgagact ggaacttgta 
aatgtcatta gagtcatctg gattgcactg 

10 tgagcaccat actaattgac tgtcctcata 
aatgcatcct gcagtggtca tggtgaatgc 
tgccaccctg gcttcctggg acccaactgt 
cttctccaag atggtaacac cctctcctcc 
ggcccatgtg tttcctctac tgagtaagag 

15 tcctcctggg tcaaactgca gacctactac 
gtcctgtagg gtgactgacg attccagttg 
tagctgaagc ttagcgtttt agttccagag 
ggttccatag agcctggacc acaggaaatg 
ttcctctctc agatgcaaag ggagagatga 

20 ttgatttttc agctgtgact tgcaaaccac 
gctcccaccc gttcggcccc ttcagctata 
gctacctgcc cagcagcatg gagaccaccg 
cgcctgctcc agcctgccat ggtaactctc 
attgcctta 

25 



acatccaacg aaccaaagac tcgggcatgt 3660 
tggctctgtg ctacacaggt atgaagtttc 3720 
gcagcctgac agattgagag ttgtataaag 3780 
atgattataa gtgattttaa tagtaattgc 3840 
gcaagatata tacatatatg tgtgtgtgta 3900 
ttctatcaat atacagagta tatatatttg 3960 
gctattctga tgtgtattta ttaccaccaa 4020 
tctggcatgc caatcaaaat ctccagcact 4080 
attctaaatt acaaacaaac aaactcatga 4140 
tggactgtgt tttctgtagc ttcgtgtacc 4200 
atagagacca tcaatagtta cacctgcaag 4260 
gagcaaggta agtcttgtcc tcacctgttt 4320 
cccagatggc ttgtgatggc ttgtcagtcg 4380 
tgatatagtg gctgagacta tgagatcaga 4440 
ccgtattact tatggcacct tgacaaagta 4500 
tcatgaaact aaataacatc atgtggcaat 4560 
aacgacaaag aaactgtttc atagtctgat 4620 
ttcaatgggt gataattatg gtgaccagcg 4680 
gcagacgtgt gtcttgattc gttgttttgt 4740 
aggaacaccc tgactatgga agcctgaact 4800 
attcctcctg ctcctttggc tgtaaaaggg 4860 
tgcggtgtac gtcctctgga gagtggagtg 4920 
ccaatgcagt aaacctcttc actcctcctc 4980 

4989 
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